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J_( Abstract 

^^ We first prove that solving Mahjong Solitaire boards with peeking is NP- 

,^ j complete, even if one only allows isolated stacks of the forms aab and abb. 

We subsequently show that layouts of isolated stacks of heights one and 
two can always be solved with peeking, and that doing so is in P, as well 
as finding an optimal algorithm for such layouts without peeking, 
i i Next, we describe a practical algorithm for solving Mahjong Solitaire 

\^J boards with peeking, which is simple and fast. The algorithm uses an 

f\ effective pruning criterion and a heuristic to find and prioritize critical 

groups. The ideas of the algorithm can also be applied to solving Shisen- 
Sho with peeking. 



Mahjong Solitaire is a game which is played with the 144 tiles of the Chinese 
— I game Mahjong. The tiles are distributed in 36 groups of four tiles each. In the 

^ beginning of the game, the tiles are stacked randomly in a predefined pattern, 

called the layout. The so-called turtle layout is used the most and therefore called 

the default layout as well. After stacking the tiles, the object is to remove all 

tiles under certain rules. These rules are as follows. 



C*~) • A tile is playable, if and only if no other tile is lying upon it, not even 

partially, and either its left side or its right side does not touch any other 
tile. 



• Only playable tiles may be played, but solely in pairs of tiles of the same 
group. Thus removing all tiles takes 72 removals of pairs of similar tiles. 
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During game play, one cannot see tiles which are completely below other tiles. 
Sometimes a tile can be seen partially, namely where it is not covered by an 
other tile. 

1 Motivation 

Since the theoretical content of this article is not enough to justify publication, 
the idea of writing this article came somewhat later and is solely motivated by 
the experience that people try to write similar algorithms for both Mahjong 
Solitaire and Shiscn-Sho, but fail to get them fast enough. Hence the theory in 
this article fulfills the main purpose of science, namely serving practice. 



The sensitivity of the performance of a program for Mahjong Solitaire to 
design is connected to the NP-completeness of the problem. For that reason, we 
include an NP-completeness result as well. Although such a result has not been 
published yet in an official forum like this, the result that Mahjong Solitaire 
with peeking is NP-complete is not new. For that reason, our result will be the 
novelty that Mahjong Solitaire with peeking is already NP-complete when the 
layout only contains isolated stacks of height three. 



2 Complexity results 

When we generalize the number of groups from 36 to any natural number, we 
get a game which can be subjected to complexity analysis. The following results 
are known. 

Theorem 1 (A. Condon, J. Feigenbaum, C. Lund, and P. Shor, [2} Theorem 
3.6]). Mahjong Solitaire is PSPACE-complete. 

Theorem 2 (D. Eppstein, [3J Shanghai]). Mahjong Solitaire is NP-complete 
when peeking is allowed. 

We only consider the variant with peeking of Mahjong Solitaire in this article. 
In order to be able to include a complexity result and to be original at the same 
time, we refine theorem [21 which results in the below theorem. 

Theorem 3. Solving Mahjong Solitaire boards with peeking is NP-complete, 
even if one only allows isolated stacks of the forms 
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(We write the tile names on the sides to make peeking easier.) 

Proof. We reduce from 3-SAT. Our construction consists of three steps. 

• Step 1: initial setup. 

We set up the following eight tile stacks. 
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For each x-group Xi, there is exactly one stack with two x^-tiles on top. 
In order to release the third tile of such a stack, it is necessary to free the 
other two Xj-tiles. This property will be maintained in the other steps. 

For each y-group yj, there is exactly one stack with two j/j-tiles below. In 
order to release tiles below one of the other two y 3 -tiles, it is necessary to 
remove the top tile of the stack with two j/j-tiles below first, since otherwise 
both J/, -tiles of this stack cannot be freed any more. This property will 
be maintained in the other steps as well. 

The SAT group is a key group. The ultimate problem (which corresponds 
to solving the instance of 3-SAT) will be to remove the first pair of the 
SAT group, which can only consist of the SAT tile below two tiles of an 
x-group and the SAT tile above the two yi-tiles, since the other two SAT- 
tiles are blocked indirectly by the SAT tile which is above the two yi-tiles. 
After removing the first SAT pair, it will be easy to clear the board. 

• Step 2: adding variables. 

For each variable Vi, we perform the following. Let m be the largest index 
of the x-groups and n be the largest index of the y-groups. Remove two 
tile stacks and add fourteen tile stacks as drawn below. 




One can show by induction that in order to release the first SAT pair, one 
must remove the V^-tile which is above two tiles of an x-group, for all i. 
Furthermore, one can only remove the tile Vi below two tiles of a y-group 
after the first SAT pair is freed, for all i. 

Thus for each i, one has to choose between the V, above two tiles of a 
ti-group and the Vi above two tiles of a /^-group, for removal along with 
the Vi-tile above two tiles of an x-group. Removing the V^-tile above two 
tiles of a <i-group first corresponds to setting Vi to true and removing the 
V^-tile above two tiles of a /,-group first corresponds to setting Vi to false. 



For each tj-group and each /^-group, there is exactly one stack with two 
tiles of the group below, just like with each y-group. This property is 
maintained in step 3. Thus the i^-groups and /^-groups act in a similar 
manner as the y-groups. 

• Step 3: adding clauses. 

For each clause Cj, we perform the following. Let m be the largest index 
of the x-groups. First, we remove one tile stack and add four tile stacks 
as drawn below. 




/,"£z;Z 


,,y 


/,"%%", 


if 


/.JSZiZ-'f, 


y 


/,-B-^f, 


,,y 


%m 


* 

if 


Cj 


3*771+1 


if/ 


X m +2 


4 

V 


•^m 


•£m-\-l 


3-771 + 1 


X m +2 


%m+2 


•£m-\-l 


Xm+2 


SAT 



Subsequently, for each of the at most three variables Vi in Cj , we do the 
following. 

— If Vi appears in a positive literal, let k be the largest index of the 
ii-groups. Remove one tile stack and add four tile stacks as drawn 
below. 
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If Vi appears in a negative literal, let k be the largest index of the 
/i-groups. Remove one tile stack and add four tile stacks as drawn 
below. 
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If we repeat adding literals to make the number of times a variable is 
treated equal to three for each clause (in case the definition of 3-SAT does 
not require exactly three literals), then for each j, all four Cj-tiles are 
used. 

One can show by induction that in order to release the first SAT pair, one 
must remove the Cj-tile which is above two tiles of an x-group, for all j. 
To do so, one of the other three Cj-tiles has to be released. 

For each literal of Cj which evaluates to true (with respect to the choice 
of the pair to be removed first in the group of the literal's variable), one 



Cj-tile can be released. Other Cj-tiles cannot be freed before removing 
the first SAT-pair. Thus the Cj-tile which is above two tiles of an x-group 
can only be released if Cj evaluates to true. 

In order to remove the first two SAT-tiles, all variables Vi must be assigned 
boolean values and all clauses Cj must be satisfied. After removing the first 
SAT-pair, all remaining V^-tiles can be released, after which all remaining Cj- 
tiles can be freed. All other tiles are removed along with this. Hence our 
reduction from 3-SAT is complete. □ 

With one level Mahjongg solitaire, all tiles are on the same level and can there- 
fore already be seen without peeking. 

Corollary 4. One level Mahjong Solitaire is NP-complete. 

Proof. Replace each stack of theorem [3] by a row of tiles, as follows. 




ZZZZZ, 


ZZZZZ 


ZZZZZ 


"ZZZZZ, 


zzzz 


ZZZZZ 


'V ""w"< 


'/' 


a 


a 


b 


c 


c 


c 


c 


( 1 


















ZZZZZ', 


/ "*" 


/ ^ 


ZZZZZ, 


zzzz 


zzzzzz, 


ZZZZZ', 


"Z\ 


a 


b 


b 


c 


c 


c 


c 


( 1 



Here, every stack of theorem [3] gets a new tile group c, of course. 
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Now that we have NP-completeness with stacks of height three, it seems natural 
to look what happens with stack of smaller heights. 

Theorem 5. If the layout only consists of isolated stacks of heights one and 
two, then one can always remove all tiles of the Mahjong Solitaire game when 
peeking is allowed. 

Without peeking, an optimal strategy for a Mahjong Solitaire layout as above 
is to repeat the following. First choose an arbitrary group which has a match, i. e. 
a pair of playable tiles. Next remove a match of that group with the maximum 
number of tiles not on the ground. 

Proof. Assume the layout only consists of isolated stacks of heights one and 
two. Since there are four tiles of each group, there cannot be a blocked cycle, see 
definition [6] below. Hence by theorem [7] below, all tiles can be removed when 
peeking is allowed. 

Without peeking, one can only remove the wrong tiles of a group when 
exactly three of the four tiles are not on the ground, in case the formulated 
strategy is obeyed. But in that case, there is no information that might lead to 
a best guess, since all three stacks of height two with a tile of the above group 
on top look the same. □ 



When playing Mahjong Solitaire, the number of tiles of each group does not 
stay equal to four, but will become two first and zero later when all tiles are 
removed. Hence we formulate the following definition. 

Definition 6. Assume the layout only consists of isolated stacks of heights one 
and two, and the number of tiles of each group is two or four. A blocked cycle 
is a subset {pi,P2, ■ ■ ■ ,Pk} of distinct groups of size two which are stacked as 
follows, where k > 1. 
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In the somewhat degenerate case k 
Pi-tiles. 



1, there is only one stack, one with two 



Notice that the board cannot be solved when a blocked cycle is present. By the 
below theorem, we see how to play with stacks of heights one and two when 
peeking is allowed. 

Theorem 7. // the layout only consists of isolated stacks of heights one and 
two, and all groups have two or four tiles, then one can remove all tiles of the 
Mahjong Solitaire game when peeking is allowed, if and only if no blocked cycle 
is present. 

Furthermore, for each group which has a match, at least one of the matches 
can be played without introducing a new blocked cycle. 

Proof. We first prove the second claim. Assume we have a layout of isolated 
stacks of heights one and two, and that some group g has a match such that 
playing this match will result in a new blocked cycle. 

Then there will be exactly one tile of g on the ground after playing that 
match. This can be counteracted by playing another match of g, except if 
exactly three tiles of g are not on the ground. But in that case, every group 
which is in the blocked cycle, except g, is already reduced to only two tiles, of 
which one is on the ground, before removing the match of g. 

Hence there is only one possible blocked cycle with tiles of g up to cyclic 
shift. Thus only one of the three tiles of g which are not on the ground is within 
a blocked cycle after playing the other two. Hence the blocked cycle can be 
broken by playing the only playable tile of g within the blocked cycle together 
with another tile of g instead. This proves the second claim of theorem [7] 

To prove the first claim, notice first that by the second claim, one can avoid 
introducing the presence of a blocked cycle as long as there is a match. Thus 
assume that there are no matches available any more. 

If more than half of the tiles are on top of a stack, then there will be a match, 
because all groups have size two at least. Furthermore, there will be a match 
when a group of size four is present, because at least half of the tiles is on top 
of a stack. 



Thus there are only stacks of height two and only groups of size two, and one 
can easily see that all stacks are contained in a blocked cycle. This completes 
the proof of theorem [7] □ 

3 A practical algorithm 

To determine whether a given board is solvable, a first idea may be to try and 
play any match and next determine the solvability recursively. If the board is 
not solvable any more after playing the first match, then try another match 
instead and test it by going into recursion again, until all possible matches are 
tried. 

But this approach is way too slow. One way to improve the performance 
is to apply cleaning operations initially (and thus implicitly in the recursion as 
well). A first cleaning operation is to play all remaining (two or four) tiles of 
a group when this is possible. Another cleaning operation is the following. If 
three tiles of a group are playable and at least one of them does not block any 
other tile, then play the other two. 

Another speedup is obtained by taking into account that two matches of 
different groups can be played in any order, and that the order does not affect 
the solvability. For this purpose, one can order the groups and require that 
matches which are playable at some stage are played in increasing order of the 
groups. Thus if one is about to play a match within a group g, all current 
matches of groups lower than g will be forbidden in the branch of the search 
tree after playing the above match within g. 

Notice that every group has six possible matches. But when playing a match 
of two tiles of a group is forbidden, it does not make sense when the other two 
tiles of that group are still played together. Thus there are only three possible 
matches to distinguish. In other words, there are three possible pairings of a 
group of four tiles, namely {{1, 2}, {3, 4}}, {{1,3}, {2, 4}} and {{1,4}, {2,3}}. 
When a match within a group g is played, a pairing of g is chosen and for 
groups of lower order than g, one pairing is marked as unallowed in case such a 
group has exactly one match. If a group of lower order than g has more than 
one match, then all pairings of that group will get forbidden, which means that 
group g is not the right group to play at the moment. 

This is more or less the algorithm of [4] . Although the algorithm is not very 
fast, it can be used to evaluate initial turtle layouts, most of them within a coffee 
break, but some of them take more than a day. Others did similar things, and 
subsequently applied low-level improvements, include writing efficient assembly 
code and searching with multiple theads. 

This is however not the best idea at this stage, because high-level improve- 
ments are far more effective and still available. We discuss two of them which 
lead to an algorithm that solves a typical board in less than a second and a 
worst-case board in about a minute. 



3.1 A pruning criterion 

The above cleaning operations can be seen as a pruning criterion, since they es- 
sentially spot groups whose removal is not problematic in any sense and branch- 
ing within such groups is eliminated. A more direct pruning criterion which can 
be tried before going into recursion is the following. 

Pruning scan. Try if you can clean the board by playing the first two tiles of 
each group simultaneously (choose any match), and playing the third and fourth 
tile individually (in case the group has four tiles). 

The rules in the pruning scan are not only more gracious than the original rules, 
but it is also impossible to play the wrong match. Therefore, if the pruning scan 
has a negative answer, then the board will not be solvable and one can search 
further without going into recursion. 

Furthermore, for groups where cleaning can be applied upon, this pruning 
criterion works as well, because such groups do not affect the result of the above 
pruning scan. But subsequent optimizations, which will be discussed in the next 
subsection, will counteract the cleaning operations. 

3.2 Critical groups 

Instead of choosing matches to play recursively, one can also choose group pair- 
ings recursively. With 36 groups, one gets a search tree of size 3 36 , since there 
are three possible pairings for each group. The advantage of a group-directed 
search tree with respect to a match- directed search tree is that one can choose 
the order of groups in a group-directed search tree in an arbitrary manner. With 
a match-directed search tree, the order of groups corresponds to the order of 
playing of the first pair of tiles of them. 

With a group-directed search tree, no tiles are actually removed, which makes 
that the prune scans will not get smaller. But a scan of a board with half of 
the tiles removed in only a factor two faster than a scan with all tiles, which is 
peanuts in this context. Furthermore, groups which are associated a pairing are 
removed corresponding to that pairing in a pruning scan and no individual tile 
removal are allowed for them, otherwise one does not progress in the search. 

To improve the effect of the pruning, we try to choose a critical group and 
prioritize it by expanding it in the search tree (on top of the current stage). 
One way to find a critical group is to choose a group that allows the minimum 
number of pairings in the pruning scan. If all unpaired groups allow all three 
pairings, then one can try to find a group whose four tiles cannot be removed 
simultaneously during a pruning scan. 

This heuristic for finding critical groups is a significant improvement of the 
algorithm, but some boards still take quite some time. For that reason, we 
present a totally different heuristic for finding critical groups, one which appears 
even better in practice since all boards seem to be solved within a reasonable 
amount of time. 



Suppose that we are either on top of the search, or that the last group which 
is assigned a pairing has a pairing greater than one. We choose the next group 
without any heuristic yet, and assign it pairing one. Next, we perform a pruning 
scan. If the pruning scan is effective, then we decide that the chosen group is 
critical, assign it pairing two (the next pairing), and go further into recursion. 

If the pruning scan is not effective, then we add another group with pairing 
one to the search tree, and repeat adding groups until we get an effective pruning 
scan (or a solution). When we hit an effective pruning scan, we assign the last 
added group pairing two just as above, but we do the following with all groups 
that we just added to the search tree with pairing one. We remove all such 
groups that are not needed for the pruning scan to be effective from the search 
tree. 

Since there might be more minimal combinations of groups that keep the 
pruning scan effective, we have to be more specific here. We remove the groups in 
backward order, starting from the last added group (which is of course necessary 
for the pruning), going up to the previous group with pairing larger than one 
(or the beginning of the search tree) , from where we started adding groups with 
pairing one. Thus we adapt the search tree backwards. For the groups that are 
removed from the search tree, we cycle tiles two, three and four either forward 
or backward, hoping that they appear critical later on due to a different first 
pairing. 

A more progressive rearrangement strategy on the search tree (based on 
maintaining a pruning) , where the progress in the ternary search tree (which is 
a ternary fraction) is maximized, appeared overdone with respect to speed. 

3.3 Random solving 

One can also try to solve the board by chosing matches randomly. This is 
likely to fail, but then one can repeat the attempt another time. In the current 
implementation of the solver, [1.2 36 J = 708 attempts are done to solve a board 
randomly (since there are 36 groups which have still four tiles initially). The 
random attempts are combined with the above cleaning heuristic, but first, a 
pruning scan on top level is performed, since some layouts are so impossible 
that almost all boards can be discarded with such a scan. 

Already twenty years ago, Ken McDonald wrote a solver which tries to solve 
boards by random match selection, see [5]. He also remarks that the difficulty of 
solvable boards can be measured by the fraction of succesful attempts of solving. 

3.4 Layouts and probabilities 

Below follow probabilities that a random board of some layout of Xmahjongg 3 
cannot be won even when peeking is allowed. All results are based on scans of 
100,000 boards, except the default layout which is based on a scan of 10,000,000 
boards. This latter scan took about 40 hours on a single thread of a Xeon 
L5420. From the other layouts, the ox took with about 110 minutes the most 



time, quickly followed by the bridge layout, which has significantly more really 
hard boards than any of the other layouts. 



Default: 2.95% 

Arena: 2.6% 
Arrow: 8.2% 
Boar: 4.7% 
Bridge: 32% 
Ceremonial: 1.8' 
Deepwell: 6.0% 
Dog: 7.3% 
Dragon: 7.5% 



Farandole: 7.9% 
Hare: 18% 
Horse: 20% 
Hourglass: 100% (all) 
Monkey: 9.9% 
Ox: 47% 

Papillon: 100% (all) 
Ram: 6.9% 



Rat: 5.1% 
Rooster: 22% 
Snake: 4.2% 
Theater: 0.62% 
Tiger: 22% 
Wedges: 4.8% 



The results for transposed layouts and the source can be found on the authors 
homepage pQ, as well as a DLL for solving and a hacked version of Xmahjongg 
3.6.1 with the solver. 
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